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Abstract —Subspace clustering is an important problem in machine learning with many applications in computer vision and pattern 
recognition. Prior work has studied this problem using algebraic, iterative, statistical, low-rank and sparse representation techniques. 
While these methods have been applied to both linear and affine subspaces, theoretical results have only been established in the case 
of linear subspaces. For example, algebraic subspace clustering (ASC) is guaranteed to provide the correct clustering when the data 
points are in general position and the union of subspaces is transversal. In this paper we study in a rigorous fashion the properties of 
ASC in the case of affine subspaces. Using notions from algebraic geometry, we prove that the homogenization trick, which embeds 
points in a union of affine subspaces into points in a union of linear subspaces, preserves the general position of the points and the 
transversality of the union of subspaces in the embedded space, thus establishing the correctness of ASC for affine subspaces. 

Index Terms —Algebraic Subspace Clustering, Affine Subspaces, Flomogeneous Coordinates, Algebraic Geometry. 
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1 Introduction 

Subspace clustering is the problem of clustering a collection of 
points drawn approximately from a union of linear or affine 
subspaces. This is an important problem in machine learning 
with many applications in computer vision and pattern recognition 
such as clustering faces, digits, images and motions (T]. Over 
the past 15 years, a variety of subspace clustering methods have 
appeared in the literature, including iterative il, 0, probabilistic 
a, algebraic 0, spectral 0, 0, and self-expressiveness-based 
0, 0, d, El, El approaches. Among them, the Algebraic 
Subspace Clustering (ASC) method of 0, also known as GPCA, 
establishes an interesting connection between machine learning 
and algebraic geometry (see also El for another such connec¬ 
tion). By describing a union of n linear subspaces as the zero 
set of a system of homogeneous polynomials of degree n, ASC 
clusters the subspaces in closed form via polynomial fitting and 
differentiation (or alternatively by polynomial factorization 11141 1. 
Merits of algebraic subspace clustering. In addition to pro¬ 
viding interesting algebraic geometric insights in the problem 
of subspace clustering, ASC is unique among other methods in 
that it is guaranteed to provide the correct clustering, under the 
mild hypothesis that the union of subspaces is transversal and 
the data points are in general position (in an algebraic geometric 
sense). This entails that ASC can handle subspaces of dimension d 
comparable to the ambient dimension D (high relative dimension 
d/D). In contrast, most state-of-the-art methods, such as Sparse 
Subspace Clustering (SSC) d, El, require the subspaces to 
be of sufficiently small relative dimension. Therefore, instances 
of applications where ASC is a natural candidate, while SSC is 
in principle inapplicable, are projective motion segmentation d, 
3D point cloud analysis El and hybrid system identification d- 
Moreover, it was recently demonstrated in d that, using the 
idea of filtrations of unions of subspaces 1201 . II21I . ASC not only 
can be robustified to noise, but also outperforms state-of-the-art 
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methods in the popular benchmark dataset Hopkins 155 1221 for 
real world motion segmentation. 

Dealing with affine subspaces. In several important applications, 
such as motion segmentation, the underlying subspaces do not 
pass through the origin, i.e., they are affine. Methods such as 
K-subspaces 0, 0 and mixtures of probabilistic PCA 0 can 
trivially handle this case by explicitly learning models of affine 
subspaces. Likewise, the spectral clustering method of El can 
handle affine subspaces by constructing an affinity that depends on 
the distance from a point to a subspace. However, these methods 
do not come with theoretical conditions under which they are 
guaranteed to give the correct clustering. On the other hand, when 
data X = [xi • ■ • aj^v] C lying in a union of n distinct affine 
subspaces are embedded into homogeneous coordinates 
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they lie in a union of n distinct linear subspaces of If 

the embedded data X satisfy the geometric separation conditions 
of El with respect to the underlying union of linear subspaces, 
then SSC El applied to X is guaranteed to yield a subspace 
preserving affinity. While the conditions in El have a clear 
geometric interpretation for linear subspaces, it is unclear what 
these conditions entail when applied to affine subspaces via the 
embedding in |[TJ. On the other hand, recent work II24I . 1251 shows 
that an version of SSC (^g-SSC) yields the correct clustering 
under mild conditions of general position (in a linear algebraic 
sense), and this analysis can be easily extended to affine subspaces. 
As opposed though to the polynomial complexity of £i-SSC ifTOt . 
^o-SSC has exponential complexity. While the ASC approach 
discussed here also has exponential complexity in general, under 
certain conditions it is more efficient than ^g-SSClj 


1. The worst-case complexity of ASC is O ), which is linear in 

the number of data points, N, and exponential in the number of subspaces, 
n, and the dimension of the original data D. In contrast, the worst-case 
complexity of fo-SSC is O (^N{D l){d 1)^ , where d is the 

maximal dimension of the affine subspaces. Hence, when n and D are small 
and d ss 0{D), the complexity of £o-SSC as a function of N dominates that 
of ASC. A detailed comparison of the complexities of ASC and fo’SSC is 
beyond the scope of the paper and is thus omitted. 
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Returning to ASC, the traditional way to handle points from a 
union of affine subspaces 1261 is to use homogeneous coordinates 
as in O, and subsequently apply ASC to the embedded data. We 
refer to this two-step approach as Affine ASC (AASC). Although 
AASC has been observed to perform well in practice, it lacks a 
sufficient theoretical justification. On one hand, while it is true that 
the embedded points live in a union of associated linear subspaces, 
it is obvious that they have a very particular structure inside these 
subspaces. Specifically, even if the original points are generic, in 
the sense that they are sampled uniformly at random from the 
affine subspaces, the embedded points are clearly non-generic, in 
the sense that they always lie in the zero-measure intersection of 
the union of the associated linear subspaces with the hyperplane 
Xq = 10 Thus, even in the absence of noise, one may wonder 
whether this non-genericity of the embedded points will affect the 
behavior of AASC and to what extent. Moreover, even if the affine 
subspaces are transversal, there is no guarantee that the associated 
linear subspaces are also transversal. Thus, it is natural to ask for 
conditions on the affine subspaces and the data points under which 
AASC is guaranteed to give the correct clustering. 

Paper contributions. In this paper we adapt abstract notions from 
algebraic geometry to the context of unions of affine subspaces in 
order to rigorously prove the correctness of AASC in the absence 
of noise. More specifically, we define in a very precise fashion the 
notion of points being in general position in a union of n linear or 
affine subspaces. Intuitively, points are in general position if they 
can be used to uniquely reconstruct the union of subspaces they lie 
in by means of polynomials of degree n that vanish on the points. 
Then we show that the embedding 0 preserves the property of 
points being in general position, which is one of the two success 
conditions of ASC. We also show that the second condition, which 
is the transversality of the union of linear subspaces in that 

is associated to the union of affine subspaces in under the 
embedding 0, is also satisfied, provided that 

1) the union of subspaces formed by the linear parts of the 
affine subspaces is transversal, and 

2) the translation vectors of the affine subspaces do not lie 
in the zero measure set of a certain algebraic variety. 

Our exposition style is for the benefit of the reader unfamiliar 
with algebraic geometry. We introduce notions and notations as 
we proceed and give as many examples as space allows. We leave 
the more intricate details to the various proofs. 

2 Algebraic Subspace Clustering Review 

This section gives a brief review of the ASC theory ((Sj, |27]| . t28]l . 
ini). After defining the subspace clustering problem in Section 
[Q we describe unions of linear subspaces as algebraic varieties 
in Section [2^ and give the main theorem of ASC (Theorem0 in 
terms of vanishing polynomials in Section [231 In Section |24| we 
elaborate on the main hypothesis of Theorem [TJ the transversality 
of the union of subspaces. In Section [Z3I we introduce the notion 
of points in general position (Definition [5j and adapt Theorem [T] 
to the more practical case of a finite set of points (Theorem[9]l. 

2.1 Subspace Clustering Problem 

Let X = {xi ,..., Xn} be a set of points that lie in an unknown 
union of n > 1 linear subspaces $ = IJfci where Si a linear 

2. Here and in the rest of the paper, we consider only the uniform measure. 


subspace of of dimension d^ < D. The goal of subspace 
clustering is to find the number of subspaces, their dimensions, a 
basis for each subspace, and cluster the data points based on their 
subspace membership, i.e., find a decomposition or clustering of 
X as X = Xi LI ■ ■ ■ U Xn, where Xi = X Cl Si. 

2.2 Unions of Linear Subspaces as Aigebraic Varieties 

The key idea behind ASC is that a union of n linear subspaces 
<I> = ULa ^ finite set of ho¬ 

mogeneous polynomials of degree n with real coefficients in 
D indeterminates x := [xi, ... ,xd\^ ■ Such a set is called 
an algebraic variety (H, 1301 . For example, a union of n 
hyperplanes $ = Hi U • ■ • U Hn, where the ith hyperplane 
T-Li = {x : bjX = 0 } is defined by its normal vector bi £ R^, 
is the zero set of the polynomial 

p{x) = {bjx){b 2 x)---{blx), ( 2 ) 

in the sense that a point x belongs to the union $ if and only if 
p{x) = 0. Likewise, the union of a plane with normal b and a line 
with normals bi , 62 in is the zero set of the two polynomials 

Pi{x) = {b^x){bjx) and P 2 {x) = (b^x){b 2 x). (3) 

More generally, for n subspaces of arbitrary dimensions, these 
vanishing polynomials are homogeneous of degree n. Moreover, 
they are factorizable into n linear forms, with each linear form 
defined by a vector orthogonal to one of the n subspaces 0 

2.3 Main Theorem of ASC 

The set X$ of polynomials that vanish at every point of a union 
of linear subspaces <!> has a special algebraic structure: it is 
closed under addition and it is closed under multiplication by 
any element of the polynomial ring TZ = ... ,xd]. Such 

a set of polynomials is called an ideal I29L 1301 of TZ. If we 
restrict our attention to the subset „ ofX$ that consists only of 
vanishing polynomials of degree n, we notice that X$ „ is a finite 
dimensional real vector space, because it is a subspace of TZn, the 
latter being the set of all homogeneous polynomials of TZ of degree 
n, which is a vector space of dimension Mn{D) := 

Theorem 1 (Main Theorem of ASC, 0). Let $ = ljr=i 
a transversal union of linear subspaces of K^. Let pi,... ,ps 
be a basis for X$ „ and let Xi be a point in Si such that Xi ^ 
[j^i^iS^i. Then Si = Span(Vpi|a;i, •. .,Vps\a,i)^. 

In other words, we can estimate the subspace Si passing 
through a point Xi, as the orthogonal complement of the span of 
the gradients of all the degree-n vanishing polynomials evaluated 
at Xi. Observe that the only assumption on the subspaces required 
by Theorem[T] is that they are transversal, a notion explained next. 

2.4 Transversal Unions of Linear Subspaces 

Intuitively, transversality is a notion of general position of sub¬ 
spaces, which entails that all intersections among subspaces are as 
small as possible, as allowed by their dimensions. Formally: 

3. A polynomial in many variables is called homogeneous if each of its 
monomials has the same degree. For example, + 0 : 10:2 is homogeneous of 
degree 2, while + X 2 is non-homogeneous of degree 2. 

4. Strictly speaking this is not always true; it is true though in the generic 
case, for example, if the subspaces are transversal (see Definition[3- 
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Definition 2 (' 1271 ). A union $ = Si of linear subspaces of 
is transversal, if for any subset 2of\n\ := {1, 2 ,..., n} 

codim n Si J = min Id,^2 codim((Si) > , (4) 

Vies / I ) 

where codim( 4 S) = D — dim( 4 S) denotes the codimension of S. 

To understand Definition [T] let J3i be a I? x Ci matrix 
containing a basis for S:^, where ct is the codimension of Si, 
and let -3 be a subset of [n], say 3 = {Ij • ■ ■ i £ tr. Then 

a point X belongs to HicT if only if = 0, where 

= [Bi ,..., B(]. Hence, the dimension of Hies is equal 
to the dimension of the left nullspace of B^j, or equivalently, 


i63 


codim ( n *^0 = rank(B3). 


(5) 


Since By is a B x J2i^y c* matrix, we must have that 


rank(B 3 ) < min |b, ^ a j . (6) 

Hence, transversality is equivalent to Bj being full-rank, as 3 
ranges over all subsets of [n], and By drops rank if and only if 
all maximal minors of By vanish, in which case there are certain 
algebraic relations between the basis vectors of S:^, *6 3- Since 
any set given by algebraic relations has measure zero ( 33 , this 
shows that a union of subspaces is transversal with probability 1. 

Proposition 3. Let $ = \Jii—i Si be a union ofn linear subspaces 
in of codimensions 0 < < D, i G [n]. Let bn, ..., bi^^ 

be a basis for S^. If the vectors {bij-}jLf^’'^' do not lie in the 
zero-measure set of a (proper) algebraic variety 
then $ is transversal. 


Example 4. Consider two planes <Si, <S 2 in with normals bi 
and b 2 . Then one expects their intersection Si H S 2 to be a line, 
and hence be of codimension 2 = min(3,1 -f 1), unless the two 
planes coincide, which happens only if bi is colinear with 62 - 
Clearly, if one randomly selects two planes in the probability 
that they are not transversal is zero. If we consider a third plane 
1 S 3 with normal by such that every intersection <Si H <S 2 , iSi H 1 S 3 
and S 2 n 1 S 3 is a line, then the three planes fail to be transversal 
only if iSi n 1 S 2 n 1 S 3 is a line. But this can happen only if the 
three normals 61 , 62 , by are linearly dependent, which again is a 
probability zero event if the three planes are randomly selected. 

This reveals the important fact that the theoretical conditions 
for success of ASC (in the absence of noise) are much weaker than 
those for other methods such as SSC, since as we just pointed out 
ASC will succeed almost surely (Theorem[T]i0 


2.5 Points In General Position 

In practice, we may not be given the polynomials pi,... ,ps that 
vanish on a union of subspaces <i> = ljr=i rather a finite 

collection of points X = {£Ci,...,£Cjv} sampled from <i>. If we 
want to fully characterize $ from X, the least we can ask is that 
X uniquely defines $ as a set, otherwise the problem becomes 
ill-posed. Since it is known that <i> is the zero set of (5), i-e., 
<i> = 2 ^(X$.„), it is natural to require that $ can be recovered 

5. Of course, the main disadvantage of ASC with respect to SSC is its 
exponential computational complexity, which remains an open problem. 


as the zero set of all homogeneous polynomials of degree n that 
vanish on X. 

Definition 5 (Points in general position). Let ^ be a union ofn 
linear subspaces ofVP, and X a finite set of points in $. We will 
say that X is in general position in <i>, if^ = Z{Xx,n)- 

Recall from Theorem [T] that for ASC to succeed, we need a 
basis pi,... ,ps for I^^n- The next result shows that if X is in 
general position in <I>, then we can compute such a basis form X. 

Proposition 6 . X is in general position in <i> Tx,n = 

Proof. (=4«) Suppose X is in general position in $, i.e., $ = 
Z[Xx,n)- We will show \ha.tXx,n = The inclusionD 
is immediate, since if p G X$,ra vanishes on <I>, then it will 
vanish on the subset X of <I>. Conversely, let p G Xx,n- Since by 
hypothesis $ = Z{Xx,n), we will have thatp(a:) = 0, fx G 4), 
i.e., p vanishes on $, i.e., p G i.e., Xx,n C X$,ra- 

(<;=) Suppose Xx,n = then Z(Xx,n) = - 2 '(X 3 >,ri). 

Since = Z(X$ „) (3, we have d> = Z(Xx,n)- □ 

Next, we show that points in general position always exist. 

Proposition 7. Any union ^ of n linear subspaces ofM.^ admits 
a finite subset X that lies in general position in $. 

Proof. This follows from Theorem 2.9 in II28I . together with the 
regularity result of 1321 . which says that the maximal degree of a 
generator of 1 $ does not exceed n. □ 

Example 8. Let ^ = Si U S 2 be the union of two planes of 
with normal vectors by, 62 . and let X = {xy, X2, Xy, X/y] be four 
points o/$, such that, Xy, X2 G Sy\ S 2 and Xy, Xy G S 2 \ 5i. 
Let TLyy and 'H 2 A be the planes spanned by a;i,a ;3 and X 2 ,X/y 
respectively, and let 613,624 be the nornmls to these planes. 
Then the polynomial q{x) = (bj^x){bjyx) certainly vanishes 
on X. But q does not vanish on $, because the only (up to a 
scalar) homogeneous polynomial of degree 2 that vanishes on $ 
is p{x) = (bj x){b 2 x). Hence X is not in general position in 
<i>. The geometric reasoning is that two points per plane are not 
enough to uniquely define the union of the two planes; instead a 
third point in one of the planes is required. 

In terms of a finite set of points X, Theorem [T] becomes: 

Theorem 9. Let X be a finite set of points sampled from a union 
<I> 0 / n linear subspaces of R^. Let py,... ,Ps be a basis for 
^X,n, the vector space of homogeneous polynomials of degree 
n that vanish on X. Let X; be a point in X; \= X C\ S; 
such that Xi ^ V ^ is in general position in <I> 

(Definition |3, and d> is transversal (Definition H, then Si = 
Span(Vpi ,..., Vps la; J-*-. 


3 Problem Statement and Contributions 

In this section we begin by defining the problem of clustering 
unions of affine subspaces in Section 13.11 In Section 13.21 we 
analyze the traditional algebraic approach for handling affine 
subspaces and point out that its correctness is far from obvious. 
Finally, in Section [53] we state the main findings of this paper. 
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3.1 Affine Subspace Clustering Problem 

Let X = {xi,... ,xn} be a finite set of points living in a union 
'I' = Ur=i ^ affine subspaces of R^. Each affine subspace 
Ai is the translation by some vector /Xj € R^ of a -dimensional 
linear subspace Si, i.e., Ai = Si + fii. The affine subspace 
clustering problem involves clustering the points X according to 
their subspace membership, and finding a parametrization of each 
affine subspace Ai by finding a translation vector /Xj and a basis 
for its linear part Si, for all x = 1,..., n. Note that there is an 
inherent ambiguity in determining the translation vectors /Xj, since 
if Ai = SiX fJbi, then Ai = iSi-|-(siH-/Xi) for any vector Si € Si. 
Consequently, the best we can hope for is to determine the unique 
component of in the orthogonal complement S^ of Si. 

3.2 Traditional Algebraic Approach 

Since the inception of ASC, the standard algebraic approach to 
cluster points living in a union of affine subspaces has been to 
embed the points into R^+^ and subsequently apply ASC I26II . 
The precise embedding (po : R^ ^ is given by 

a = {ai,..., an) oc = ao). (7) 

To understand the effect of this embedding and why it is mean¬ 
ingful to apply ASC to the embedded points, let A = 5 -|- /x be 
a d-dimensional affine subspace of R^, with Ui,... ,Ud being a 
basis for its linear part S. As noted in Section 13.11 we can also 
assume that /x C 5^. For x G A, there exists y G such that 


Definition!^, and ii) the union of linear subspaces $ is transversal. 
Note that these conditions need not be satisfied a priori because of 
the particular structure of both the embedded data in (|T]| and the 
basis in (|9]l. This gives rise to the following reasonable questions: 

Question 10. Under what conditions on X and If, will X be in 
general position in $ ? 

Question 11. Under what conditions on 'll will be transversal? 

3.3 Contributions 

The main contribution of this paper is to answer Ouestions IlOlllI 
Regarding Question[T0l one may be tempted to conjecture that 
X is in general position in $, if the components of the points 
X along the union $ := [jr=i of the linear parts of the 
affine subspaces are in general position inside <!>. However, this 
conjecture is not true, as illustrated by the next example. 

Example 12. Suppose that Ih = Ai U A 2 is a union of two affine 
planes Ai = Si + /Xj of R^. Then ^ = Si U S 2 is a union of 2 
planes in R^ and as argued in Example we can find 5 points 
in general position in $. However, $ = <Si U <S 2 A ci union of 2 
hyperplanes in R^ and any subset of $ in general position must 
consist of at least A 42 ( 4 ) — 1 = —1 = 9 points^ 

To state the precise necessary and sufficient condition for X to be 
in general position in $, we first show that Ih is the zero-set of 
non-homogeneous polynomials of degree n. 


x = Uy + pi, J7 := [mi, ... ,Md] G (8) 


Then the embedded point x := can be written as 


'I' 


1 


■ 1 0 • 

0 

X 

= u 

. y . 

, U := 

/X 1x1 • 

• Ud 


• (9) 


Equation ([9]l clearly indicates that the embedded point x lies in 
the linear (d-|- l)-dimensional subspace S := Span(i!7) of 
and the same is true for the entire affine subspace A. From @ one 
sees immediately that (iti,..., xx^, /x) can be used to construct a 
basis of S. The converse is also true: given any basis of S one can 
recover a basis for the linear part S and the translation vector /x 
of A. Hence, the embedding (po takes a union of affine subspaces 
T' = ljr=i into a union of linear subspaces < 1 > = IJILi 

in a way that there is a 1 — 1 correspondence between the 
parameters of Ai (a basis for the linear part and the translation 
vector) and the parameters of Si (a basis) for every i G [n]. 

To the best of our knowledge, the correspondence between Ai 
and Si has been the sole theoretical justification so far in the sub¬ 
space clustering literature for the traditional Affine ASC (AASC) 
approach for dealing with affine subspaces, which consists of 


1) applying the embedding fin to points X , 

2 ) computing a basis pi,... ,ps for the vector space 

of homogeneous polynomials of degree n that vanish on 
the embedded points X := fio(X), 

3) for Xi G X HiSi \|Jj^j/ Si, estimating Si via the formula 

Si = Span(Vpi|i,i,-..,Vps !*;)■'■, (10) 

4) and extracting the translation vector of Ai and a basis for 
its linear part from a basis of Si. 


Proposition 13. Let T' = \Si—i Ai be a union of affine subspaces 
o/R^, where each affine subspace Ai is the translation of a linear 
subspace Si of codimension Ci by a translation vector /Xj. For each 
Ai = Si X fii, let bii, ..., bic^ be a basis for S^. Then is the 
zero set of all degree-n polynomials of the form 

n 

n G [Ci] X ••• X [c„]. (11) 

i=l 

Thanks to Proposition! 1 31 we can define points X to be in general 
position in in analogy to Definition [5] 

Definition 14. Let ^ be a union ofn affine subspaces o/R^ and 
X a finite subset of iP. We will say that X is in general position 
in T', if T' can be recovered as the zero set of all polynomials of 
degree n that vanish on X. Equivalently, a polynomial of degree 
n vanishes on T' if and only if it vanishes on X. 

We are now ready to answer our Question 1 101 

Theorem 15. Let X be a finite subset of a union of n affine 
subspaces IP = UlLi '^here Ai = Si X Hi, with 

Si a linear subspace of R^ of codimension 0 < Ci < D. Let 
$ = ljr=i union ofn linear subspaces o/R^“*'^ induced 

by the embedding fio : R^ in ([7]i. Denote by X <Z ^ 

the image of X under fi^. Then X is in general position in if 
and only if X is in general position in Ip. 

Our second Theorem answers Question (TT] 

Theorem 16. Let Ih = IJfci ® union ofn affine subspaces 
ofM.^, with Ai = SiXHi and /Xj = Bitti, where Bi G is 

a basis for S^ with Ci = codim 5^. If^ = Ufci transversal 


According to Theorem the above process wifi succeed, if i) the g Otherwise one can fit a polynomial of degree 2 to the points, which does 
embedded points X are in general position in (in the sense of not vanish on <I>. 
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and ai,..., On do not lie in the zero-measure set of a proper 4 ALGEBRAIC GEOMETRY OF UNIONS OF AFFINE 
algebraic variet^ofW'^^ x • • • x then $ is transversal. SUBSPACES 


One may wonder if some of the /Xj can be zero and $ still be 
transversal. This depends on the Ci as the next example shows. 


Example 17. Let Ai = Span( 6 ii, -i- pL I be an affine line 

and A 2 = Span( 62 )'^ + M 2 affine plane o/R^. Suppose that 
$ = Span( 6 ii, bi 2 )'‘‘ U Span( 62 )^ is transversal. Then $ = 
sSi U S 2 is transversal if and only if the ma trix 

^>11 bi 2 62 


G (12) 


has rank 3 . But rank j = 3 , irrespectively of what the Mi are, 

simply because the ixi matrix B := [bn 612 62 ] is full rank (by 
the transversality assumption on $). Now let us replace the affine 
plane A 2 with a second affine line A 2 = Span(b 2 i, ^ 22 )^ +M2- 
Then <I> is transversal if and only if 

fj ._ ~bnMi “bnMi ~bj2M2 

L bii bi2 b2i b22 

has rank 4, which is impossible if both pi^, fi 2 are zero. 


GR'*"'"' (13) 


As a corollary of Theorems [9l [15] and [T^ we get the correctness 
Theorem of ASC for the case of affine subspaces. 


Theorem 18. Let 4^ = IJlLi be a union of affine subspaces of 
with Ai = iSi + Mi and pt^ = Bitti, where Bi G 
is a basis for with Ci = codim iSi. Let $ = <Si be the 

union of n linear subspaces ofM.^^^ induced by the embedding 
(f>o : ^ of (|7]i. Let X be a finite subset of T* and 

denote by X d ^ the image of X under (j)Q. Let pi,... ,ps be 
a basis for the vector space of homogeneous polynomials 

of degree n that vanish on X. Let at G H Ai \ lJi>i ^ad 
denote x = cj)Q{x). Define 

bfc := Vpfcjsi G R^+\ k = l,...,s, (14) 


and without loss of generality, let hi,... be a maximal linearly 
independent subset 0 / 61 ,..., bg. Define further ( 7 ^, 6 ^) G R x 
R-® and ( 7^1 Bi) G R^ x as 


bh =■ 


Ik 

bk 


, k = !,...,£ 


7 i := , Bi := [ 61 ,..., 6^]. 


(15) 

(16) 


If X is in general position in '!',$ = [JiLi '^i is transversal, 
and ai,..., a„ do not lie in the zero-measure set of a proper 
algebraic variety o/R°'^ X • • • X R°", then 

Ai = Span(Bi)-^ - 7i- (17) 

Remark 19. The acute reader may notice that we still need to 
answer the question of whether T* admits a finite subset X in 
general position, to begin with. This answer is affirmative: If 4' 
satisfies the hypothesis of Theorem \16\ then $ will be transversal, 
and so by Proposition 15 1 I X>t< is generated in degree < n, in which 
case the existence of X follows from Theorem 2.9 in M8\l . 


The rest of the paper is organized as follows: in Section |4| 
we establish the fundamental algebraic-geometric properties of 
a union of affine subspaces. Then using these tools, we prove 
in Section Theorems [T^ and [T^ The proof of Theorem [T^ is 
straightforward is thus omitted. 


7. The precise description of this algebraic variety is given in the proof of 
the Theorem in Section lSlll 


In Section 14.11 we describe the basic algebraic geometry of affine 
subspaces and unions thereof, in analogy to the case of linear 
subspaces. In particular, we show that a single affine subspace 
is the zero-set of polynomial equations of degree 1 , and a union 
th of affine subspaces is the zero-set of polynomial equations of 
degree n. In Section 14.21 we study more closely the embedding 
A S of an affine subspace A C R^ into its associated linear 
subspace S C R^^^ (see Section [T2] |. which will lead to a deeper 
understanding of the embedding Ih $ of a union of affine 
subspaces dt C R^ into its associated union of linear subspaces 
$ C R^^^. As we will see, is dense in $ in a very precise 

sense, and the algebraic manifestation of this relation (Proposition 
[STT i will be used later in Section lSTl to prove our Theorem [TSl 

4.1 Affine Subspaces as Affine Varieties 

Let A = S+pt be an affine subspace of R^ and let 61 ,..., 6 c be a 
basis for the orthogonal complement of S. The first important 
observation is that a vector x belongs to S if and only if a; _L 
bfc, Vfc = I,..., c. In the language of algebraic geometry this is 
the same as saying that S is the zero set of c linear polynomials: 

S = Z [bjx,... , x-.= [xi,...,xd\^ . (18) 

Definition 20. Let y be a subset of R^. The set Xy of poly¬ 
nomials p{xi,. . . , Xjj) that vanish on y, i.e., p{yi, . . . , yo) = 
0 , f[yi,...,yDV is called the vanishing ideal ofy. 

One may wonder if the linear polynomials bjx, i = 1,..., c, 
form some sort of basis for the vanishing ideal Xs of S. In fact 
this is true (see the appendix in (la for a proof) and can be 
formalized by saying that these linear polynomials are generators 
of Xs over the polynomial ring TZ = R[a:i, ... ,Xd]. This means 
that every polynomial that belongs to Xs can be written as a linear 
combination of bJx ,..., bJx with polynomial coefficients, i.e., 

p{x) = pi{x){bjx)-\ - \-pcix){bJx) (19) 

where ,..., pc are some polynomials in TZ. More compactly 

Xs = {bjx,... ,bjx), ( 20 ) 

which reads as Xs is the ideal generated by the polynomials 
blx,..., bjx as in (dill. The following important facjj will be 
used in Section lSTl to prove our TheoremllSI 

Proposition 21. The vanishing ideal Xs of a linear subspace S 
is always a prime ideal, i.e., if p,q are polynomials such that 
pq G Xs, then either p € Xs or q G Xs. 

Moving on, the second important observation is that at G A if and 
only if a; — /X G <S. Equivalently, 

xGA O bkAx — pt, Vfc = I,...,c ( 21 ) 

or in algebraic geometric terms 

A = Z (^ 6 ^x - bJpt,... ,bjx - bJpi}j . (22) 

In other words, the affine subspace A is an algebraic variety of R^. 
In fact, we say that A is an affine variety, since it is defined by 
non-homogeneous polynomials. To describe the vanishing ideal 

8. For a proof see Appendix C in ED 
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Ia of A, note that a polynomial p{x) vanishes on A if and only if 
p{x + pb) vanishes on S. This, together with ( 120b . give 

2a = {bjx-blp,,...,blx-blp}j. (23) 

Next, we consider a union 'h = IJfci affine subspaces 
Ai = iSi + /Xj, f G [n], of R^. We will prove Proposition 1131 
which describes 'P as the zero-set of non-homogeneous polyno¬ 
mials of degree n, showing that 'P is an affine variety of R^. 

Proof. (Proposition 1 13b Denote the set of all polynomials of the 
form ifTTb by V. First, we show that 'P C Z{V). Take a; G T'; we 
will show that x G Z{V). Since 'P = Ai U • ■ • U A„, x belongs 
to at least one of the affine subspaces, say x G Ai, for some i. For 
every polynomial p of V, there is a linear factor bj^. x — bj^. /Xj of 
p that vanishes on Ai and thus on x. Hence p itself will vanish on 
X. Since p was an arbitrary element of V, this shows that every 
polynomial of V vanishes on x, i.e., x G Z(V). 

Next, we show that Z{V) C 'P. Let x G Z(V); we will 
show that X G If a; is a root of all polynomials pij(x) = 
bJjX — bJj/j,i, then a: G Ai and we are done. Otherwise, one of 
these linear polynomials does not vanish on x, say pii(x) 0 . 
Now suppose that a; ^ By the above argument, for every affine 
subspace Ai there must exist some linear polynomial bj^x—bji^ii, 
which does not vanish on x. As consequence, the polynomial 

n 

p{x)= n “ biiP'i) ( 24 ) 

i=l 

does not vanish on x, i.e., p{x) 0. But because of the definition 
of V, we must have that p G V. Since x was selected to be 
an element of Z[V), we must have that p{x) = 0, which is a 
contradiction, as we just saw that p{x) 0. Consequently, the 
hypothesis that a; 0 'P, must be false, i.e., ZifP) C □ 

The reader may wonder what the vanishing ideal 2$ of 'P is 
and what is its relation to the linear polynomials whose products 
generate 'P, as in Proposition [T^ In fact, this question is still 
partially open even in the simpler case of a union of linear 
subspaces 1331 . m, l27l. As it turns out, is intimately related 
to where $ = [JiLi union of linear subspaces 

associated to under the embedding (j)Q of (|7]l. It is precisely this 
relation that will enable us to prove Theorem [15] and to elucidate 
it we need the notion of projective closure that we introduce next 0 


4.2 The Projective Closure of Affine Subspaces 


Let (j)o{A) be the image of A = S + fx under the embedding 
tpo : R^ ^ R^+^ in (|7|l. Let 5 be the {d + 1)-dimensional 
linear subspace of R^+^ spanned by the columns of U (see (j^). 
A basis for the orthogonal complement of S in R^+^ is 


bi 



-bjfx 

be 


(25) 


since codim(5i) = codim(iS), and the bi are linearly indepen¬ 
dent because the bi are. In algebraic geometric terms 

S = Z (bjx- {bJn)xo,...,bJx- {bjn)xo'j 

f~T ~T N T (^^^ 

= Z [bj^ X,... ,b^ x) , X := [xo,a:i,.. .,xd] ■ 


9. Of course, the notion of projective closure is a well-known concept in 
algebraic geometry; here we introduce it in a self-contained fashion in the con¬ 
text of unions of affine subspaces, dispensing with unnecessary abstractions. 


By inspecting equations ll22b and ( 126b . we see that every point of 
(j){) (A) satisfies the equations (126b of S. Since these equations are 
homogeneous, it will in fact be true that for any point x G ipo (A) 
the entire line of R^+^ spanned by x will still lie in S. Hence, 
we may as well think of the embedding (j)Q as mapping a point 
X G R^ to a line of To formalize this concept, we need 

the notion of projective space 1301 . (H: 

Definition 22. The real projective space is defined to be the set 
of all lines through the origin in Each non-zero vector a 

o/R^“'"^ defines an element [a] o/P^, and two elements [a], [/3] 
of P^ are equal in P^, if and only if there exists a nonzero 
A G R such that we have an equality a. = A/3 of vectors in 
For each point [a] G P^, we call the point a. G R^^^ a 
representative of [o;]. 

Now we can define a new embedding 0o • R^ R^i that 
behaves exactly as ^0 in except that it now takes points of R^ 
to lines of or more precisely, to elements of P^; 

(at, Q!2, • ■ ■, Oio) [( 1 , o;i,a2, - • ■ , old)]- ( 27 ) 

A point at of A is mapped by (fo to a line inside S, or more 
specifically, to the point [a;] of P^, whose representative x 
satisfies the equations ( 126b of S. The set of all lines of R^+t 
that live in S, viewed as elements of P^, is denoted by [5], i.e., 

[5] = |[a] G P^ : a G . (28) 

The representative a of every element [a] G [iS] satisfies by 
definition the equations ( 126b of S, and so [iS] has naturally the 
structure of an algebraic variety of P^, which is called a projective 
variety. We emphasize that even though the varieties S and [5] live 
in different spaces, R^+^ and P^ respectively, they are defined 
by the same equations. In fact, every algebraic variety y of R^+^ 
that is the unions of lines, which is true if and only if y is defined 
by homogeneous equations, gives rise to a projective variety 
of P^ defined by the same equations. 

Example 23. Recall from Section 12.21 that a union $ of linear 
subspaces is defined as the zero-set of homogeneous polynomials. 
Then $ gives rise to a projective variety [$] o/P^ defined by the 
same equations as $, which can be thought of as the set of lines 
through the origin in that live in $. 

Returning to our embedding 0o> to describe the precise connection 
between (j)Q (A) and [5] we need to resort to the kind of topology 
that is most suitable for the study of algebraic varieties 1301 . 01: 

Definition 24 (Zariski Topology). The real vector space R^ and 
the projective space P^ can be made into topological spaces, by 
defining the closed sets of their associated topology to be all the 
algebraic varieties in R^ and P^ respectively. 

We are finally ready to state without proof the formal algebraic 
geometric relation between 0o(A) and S: 

Proposition 25. In the Zariski topology, the set 4>o{A) is open 
and dense in [<S], and so [<S] is the closur^^ of fio{A) in P^. 

The projective variety [5] is called the projective closure of A: 
it is the smallest projective variety that contains (/)o(A). We now 
characterize the projective closure of a union of affine subspaces. 

10. It can further be shown that [5] = 0o(A) U [<S]: intuitively, the set that 
we need to add to 1^0 (A) to get a closed set is the slope [5] of A. 
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Proposition 26. Let 'I' = be a union of affine subspaces 

o/K^. Then the projective closure of'^ in P^, i.e., the smallest 
projective variety that contains 0o('I'), is 

n r n ’ 

[j[Si]= U‘5. 

where Si is the linear subspace of corresponding to Ai 

under the embedding fo of 0. 

The geometric fact that [$] C P^ is the smallest projective variety 
of P^ that contains 0 o('I')i manifests itself algebraically in 
being uniquely defined by and vice versa, in a very precise 
fashion. To describe this relation, we need a definition. 



Definition 27 (Homogenization - Dehomogenization). Let p S 
TZ = K[xi ,... ,xd] be a polynomial of degree n. The homoge¬ 
nization of p is the homogeneous polynomial 


Ah) _ n 
p — XqP 


/ ^ 

I ’ ’ 

\Xq Xq 



(30) 


of TZ = K[xo, Xi,..., Xo] of degree n. Conversely, if P € TZ 
is homogeneous of degree n, its dehomogenization is P[d) = 
P(l, Xi,..., xd), which is a polynomial ofTZ of degree < n. 

Example 28. Let P = XqXi +Xox| + X 1 X 2 X 3 be a homogeneous 
polynomial of degree 3. Its dehomogenization is the degree-3 
polynomial P{d) = Xi + x| + X 1 X 2 X 3 , and the homogenization 

of Pm »(Pmffi = (ii + i + = p- 

The next result from algebraic geometry is crucial for our purpose. 

Theorem 29 (Chapter 8 in ISOl). Let y be an affine variety of 
and let y be its projective closure in P^ with respect to the 
embedding (jjQ of (HHi. Let Xy ,Xy be the vanishing ideals ofy, y 
respectively. Then Xy = X^\ i.e., every element ofXy arises as 
a homogenization of some element ofXy, and every element ofXy 
arises as the dehomogenization of some element ofXy. 

We have already seen that <I> and [$] are given as algebraic 
varieties by identical equations. It is also not hard to see that the 
vanishing ideals of these varieties are identical as well. 

Lemma 30. Let <i> = UtLi union of linear subspaces of 

and let [$] = corresponding projective 

variety o/P^. Then X^ i.e., a degree-k homogeneous 

polynomial vanishes on $ if and only if it vanishes on [<!>]. 

As a Corollary of Theorem 1291 and Lemma we obtain the key 
result of this Section, which we will use in Section [5T] 


Proposition 31. Let T' = IJlLi union of affine subspaces 

ofM.^. Let $ = UiLi‘5* union of linear subspaces of 

associated to 'll under the embedding <j>Q of 0. Then 
is the homogenization ofXq,. 


5 Proofs OF Main Theorems 
5.1 Proof of Theorem [Tsl 

Suppose that X is in general position in 'k. We need to show 
that X is in general position in <!>. In view of Proposition]^ and the 
fact that it is sufficient to show that 

To that end, let P be a homogeneous polynomial of degree n in 


R[xo, Xi,..., xd] that vanishes on the points X, i.e., P 
Then for every point a = (1, ai,..., q;_d) of A”, we have 

P(q!) =P(1,q;i,...,q;£)) = P(d)(ai,..., an) = 0 , (31) 

that is, the dehomogenization P(^d) of P vanishes on all points 
of X, i.e., P(^d) C Px- Now there are two possibilities: either 

P(d) has degree n, in which case P = (P(^))^^\ or Pf^d) has 
degree strictly less than n, say n — k, k > 1, in which case P = 
4 If P(^d) has total degree n, by the general position 

assumption on X, Pf^d) must vanish on 'P. Then by Proposition]^ 

C P<s>,n, and so P G If degP(d) =n — k, k > 1, 

suppose we can find a linear form G = C x, that does not vanish 
on any of the Si, i G [n], and it is not divisible by xq. Then G(^d) 
will have degree 1 and will not vanish on any of the , z G [n]. 
Also (G(d)) P(d) has degree n and vanishes on X. Since X is 
in general position in 'P, we will have that {G)d)) P{d) vanishes 
on 'll. Then by Proposition]^ {P{d)Y^^ n- Since X^ = 

nr=i^Si we must have that {P{d))^^^ S Xj., Vz G [n]. Since 
Xg is a prime ideal (Proposition 12 111 and G ^ Xj , it must be the 

case that (P(d))^^^ C Xj., Vi G [n], i.e., (P(d)j^^^ G P^- But 

P = Xq (P(d))^^\ which shows that P G X$ 

It remains to be shown that there exists a linear form G non- 
divisible by Xq, that does not vanish on any of the Si. Suppose this 
is not true; thus if G = 6 ^x + axo is a linear form non-divisible 
by Xq, i.e., b yl 0, then G must vanish on some Si. In particular, 
for any non-zero vector b of R^, b^x = 6 ^x -|- Oxq must vanish 
on some Si. Recall from Section 13.21 that if Un,..., Uid^ is a 
basis for Si, the linear part of Ai = Si X pti, then 

^ O'" 0 ^ 22 ) 

l^i ttii • • • Uid^ 

is a basis for Si. Since b^x vanishes on Si, it must vanish on each 
basis vector of Si. In particular, b^Un = • • • = b^Uid^ = 0, 
which implies that the linear form 6 ^x, now viewed as a function 
on R^, vanishes on Si, i.e., b^x G Psi. To summarize, we have 
shown that for every 0 7 ^ b G R^, there exists an z G [n] 
such that 6 ^x G X 5 .. Taking b equal to the standard vector ei 
of R^, we see that the linear form xi must vanish on some Si, 
and similarly for the linear forms X 2 ,.. •, xd. This in turn means 
that the ideal m := (xi,..., xd) generated by the linear forms 
Xi,.. ., Xd, must lie in the union Ufci PSi- But it is known from 
Proposition 1.1 l(i) in I29II . that if an ideal a lies in the union of 
finitely many prime ideals, then the a must lie in one of these 
prime ideals. Applying this result to our case, we see that, since 
the Xsi are prime ideals, m C X 5 . for some z G [n]. But this says 
that for any vector in Si all of its coordinates must be zero, i.e.. 
Si = 0, which violates the assumption di > 0,, Vz G [n]. This 
contradiction proves the existence of our linear form G. 

(<;=) Now suppose that X is in general position in $. We need 
to show that X is in general position in 'P. To that end, let p be 
a vanishing polynomial of Ip of degree rz, then clearly p G Px- 
Conversely, let p G Xx of degree n. Then for each point a X 

0 = p{ol) =p{ai,...,aD) 

= p^^\l,ai,...,aD)=P^^\oi.), (33) 

i.e., the homogenization vanishes on X. By hypothesis X is 
in general position in $, hence ^P^ n- Then by Proposition 
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[3n the dehomogenization of must vanish on But notice 

that ( ) = V, and so p vanishes on 'I'. 

)(d) 

5.2 Proof of Theorem [TSl 

If bii,..., bic- is an orthonormal basis of <$/*, bn,, bic-, with 

bin ■- [bjp - bJj.BiOi]^ = [bjj. - ai{ji)]^, ji € [d], (34) 

is a basis for . Suppose that $ is not transversal. Then there is 
some index set C [n], say ^ = {1,..., i} , i < n, such that 

rank(.B 3 ) < min < Z? + 1, Ci > , where (35) 

I iG3 J 

Bci := [Si,..., , Bi := [bn,... ,bici]. (36) 

Since $ is transversal, either rank(JB 3 ) = D or rank(JB 3 ) = 
Ci =: c^. Suppose the latter condition is true, then cj < D. 
Then all columns of are linearly independent, which implies 
that the same will be true for the columns of Bj, and so 
rank(B 3 ) = c^. Since by hypothesis c;] < D, we have that 

codim Pi 5i = rank(.B 3 ) = min {D + 1, cj} , (37) 

which contradicts (in. Consequently, it must be the case that 
rank(JB 3 ) = D < c:j. Since By is a submatrix of By, we must 
have rankBj > D. On the other hand, because of (135b we must 
have rank(B 3 ) < D, and so rank(B 3 ) = D. Now, By is a 
(B + 1) y. Cy matrix, with its minimal dimension being {D + 1). 
Since its rank is D, it must be the case that all (B + 1) x (B + 
1) minors of By vanish. Taking the Laplace expansion of each 
such minor along its last row (the only place where the variables 
ai,..., a„ appear), we see that it is in fact a linear polynomial 
in the a^, with the coefficients being D x D minors of JB^. Since 
By has rank D, at least one of its D X D minors is nonzero, 
and so at least one of the {D + 1) X (D + 1) minors of By is a 
nonzero polynomial. Thus the algebraic variety Wy defined by the 
vanishing of all the minors of By is a proper subset of the space 
n”=i that parametrizes the {ai}. Finally, is transversal if 
and only if the {ai} lie in the (open and dense) complement of 
the proper algebraic variety W := U^cH ^3 IliLi 

6 Conclusions 

We established in a rigorous fashion the correctness of ASC in 
the case of affine subspaces. Using the technical framework of 
algebraic geometry, we showed that the embedding of points lying 
in general position inside a union of affine subspaces preserves the 
general position. Moreover, the embedding of a transversal union 
of affine subspaces will almost surely give a transversal union of 
linear subspaces. Future research will aim at optimal realizations 
of the embedding in the presence of noise, analyzing SSC for 
affine subspaces, and reducing the complexity of ASC. 
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